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1 

HUMAN ESTROGEN RECEPTOR-BETA 

Field of the Invention 

This invention pertains to DNA encoding a novel human estrogen 
5 receptor-P (hERp), hERp polypeptides, and methods for expressing and 
isolating hERp. The invention also pertains to methods for using hERp to 
identify coactivators and inhibitors as well as tissue-specific estrogens. 

Background of the Invention 

10 The physiological response to steroid hormones is mediated by 

specific interactions of steroids with nuclear receptors, which are ligand- 
activated transcription factors that regulate the expression of target genes 
by binding to specific DNA response elements. These receptors comprise 
(in an aminoterminal-to-carboxyterminal direction) a hypervariable 

15 aminoterminal domain that contributes to the transactivation function; a 
highly conserved DNA-binding domain responsible for receptor dimerization 
and specific DNA binding; and a carboxyterminal domain involved in ligand- 
binding, nuclear localization, and iigand-dependent transactivation. 

Recently, cDNA was cloned from rat prostate and was shown to 

20 have significant homology to a previously isolated rat estrogen receDtor 
cDNA. Kuiperet al., Proc.Natl.Acad.Sci.USA 93:5925, 1996. This 
receptor was designated ERp to distinguish it from a previously cloned 
receptor, ERa. Rat ERp was shown to be expressed in the prostate, testes, 
ovary, and thymus, in contrast to ERa, which is most highly expressed in 

25 the uterus, breast, liver, and pituitary. 

A human ERP homologue having the aminoterminal sequence Gly- 
Tyr-Ser has been reported. Mosselman et al., FEBS Letts. 392:49, 1996. 
This reported sequence lacks an initiator methionine, however; therefore, 
the complete aminoterminal sequence could not be determined. Thus, the 
30 full-length human gene remained unknown and an accurate picture of the 
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molecular determinants of the transactivation function of authentic hEFtp 
could not be obtained. 

Summary of the Invention 

5 The present invention provides nucleic acids encoding a full-length 

human estrogen receptor-p (hERPL). The nucleic acid sequence of hERPL, 
which is depicted in Figure 3, SEQ ID NO:1, encodes a receptor having the 
amino acid sequence depicted in Figure 4, SEQ ID NO:2. hERPL according 
to the present invention contains 45 amino acids at its aminoterminus 

10 which were not previously known. These amino acids are believed to 
contribute to the transcription activation function of the receptor. 

hERPL is selectively expressed in the thymus, spleen, ovary, and 
testes. Accordingly, hERp can be used to identify co-activator proteins that 
are involved in estrogen-regulated gene expression, as well to identify 

15 tissue-selective estrogens. 

The present invention provides isolated polypeptides having the 
sequence of SEQ ID NO:2 and function-conservative variants thereof which 
exhibit estrogen-regulated transcriptional activation activity. In a related 
aspect, the invention encompasses isolated peptides derived from hERp 

20 comprising a sequence corresponding to amino acids 1-45 of SEQ ID NO:2 
and function-conservative variants thereof, as explained above. It is 
believed that this sequence provides at least part of the transactivation 
function. 

The present invention also provides isolated nucleic acids encoding 
25 hERPL and hERpL-derived peptides, including the nucleic acid sequence 
depicted in Figure 3, SEQ ID NO:1 and subfragments thereof encoding 
peptides which comprise amino acids 1-45, as well as sequence- 
conservative and function-conservative variants thereof. Also 
encompassed by the invention are DNA vectors comprising an hERpL- 
30 encoding sequence operably linked to a transcription regulatory element and 
cells comprising these vectors. Methods for producing hERpL-derived 
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polypeptides include incubating a cell comprising an hERpL-encoding 
expression vector under conditions that permit expression of one or more 
hEFtp polypeptides. The methods further include: (a) harvesting the cells to 
produce a cell fraction and a medium fraction; and (b) recovering the 
5 polypeptide(s) from the cell fraction, medium fraction, or both. 

In another aspect, the invention provides methods for identifying 
hERp-interactive compounds, including agonists, antagonists, and co- 
activator proteins. In one embodiment, the method includes: 

(a) contacting purified hERp with a labeled ligand in the 

10 presence of test compounds, to form test reactions, and in the absence of 
test compounds, to form control reactions; 

(b) incubating the test and control reactions under 
appropriate conditions to achieve specific binding of the labelled ligand to 
hERP; 

15 (c) determining the level of binding of the labeled ligand to 

hERp in said test and control cultures; and 

(d) identifying as a hERp-interactive compound any 
compound that reduces the binding of the labeled ligand to hERp. 



20 Brief Description of the Drawings 

Figure 1 is a schematic illustration of the oligonucleotides used for 
PCR amplification of human estrogen receptor-p (hERPD cDNA (SEQ ID 
NO:5 and SEQ ID NO: 6). 

Figure 2 is a schematic illustration of the pcDNA3 plasmid 
25 containing hERpL 
cDNA. 

Figure 3 is an illustration of the full-length cDNA sequence encoding 
human estrogen receptor-p (hERpL) (SEQ ID NO:1). 

Figure 4 is an illustration of the predicted amino acid sequence of 
30 the hERp polypeptide (SEQ ID NO:2). The first 45 (previously unknown) 
amino acids are underlined. 
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Figure 5 is a photographic illustration of an autoradiogram of a 10% 
SDS-polyacrylamide gel in which hERP in vitro translation products are 
resolved. Lane 1, hERpT; lane 2, hERpL produced from a vector encoding a 
synthetic translation initiation site; lane 3, hERPL produced from a vector 
5 encoding the natural hERP translation initiation sequences. 

Figure 6A is a graphic illustration of the transcriptional activation 
capacity of full-length hERP (hERpL) and truncated hERp (hERpT) expressed 
in HepG2 cells. Cells were transfected with either hERpL or hERpT and co- 
transfected with a luciferase reporter plasmid containing an estrogen 
10 response element (ERE) (ERE.TK.LUC) and a control P-galactosidase 

plasmid. Cells were incubated in the absence or presence of estradiol, after 
which luciferase activity was measured and normalized to p-galactosidase 
activity. 

Figure 6B is a graphic illustration of luciferase activity in HepG2 cells 

15 transfected with either hERpL or hERpT and co-transfected with a luciferase 
reporter plasmid lacking an ERE (TK.LUC) 

Figure 7A is a graphic illustration of the effect of estradiol 
stimulation of full-length hERp (hERpL) and truncated hERp (hERPT) on NFkB 
activation in HepG2 cells. Cells were transfected with either hERPL or 

20 hERPT and co-transfected with a luciferase reporter plasmid containing 
three copies of an NFkB binding site (3X-NFkB TK.LUC) and a control P- 
galactosidase plasmid. Cells were stimulated with interleukin-1b and 
incubated in the absence or presence of estradiol, after which luciferase 
activity was measured and normalized to p-galactosidase activity. Figure 

25 7B is a graphic illustration of luciferase activity in cells transfected with 

either hERpL or hERpT and co-transfected with a luciferase reporter plasmid 
lacking an NFkB binding site (TK.LUC). 

Figure 8 is a graphic illustration of the transcriptional activation 
capacity of full-length hERp (hERPL) and truncated hERp (hERpT) expressed 

30 in HAECT-1 human endothelial cells. Cells were transfected with either 
hERPL or hERPT and co-transfected with a luciferase reporter plasmid 
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containing an ERE (ERE.TK.LUC) or one lacking ah ERE (TK.LUC). Cells were 
incubated in the absence or presence of estradiol, after which luciferase 
activity was measured. ERE TK.LUC values were normalized to TK.LUC 
values and are presented as mean _±_ S.E. (n = 4). 
5 Figure 9 is a graphic illustration of the effect of increasing doses of 

estrogens (17-p estradiol or genistein) on the transcriptional activation 
capacity of full-length hER(3 (hER(3L) and truncated hERp (hERpT) expressed 
in S. cerevisiae. Cells were transformed with either hERpL or hERPT and 
co-transformed with a p-galactosidase reporter plasmid containing an ERE. 
10 Transformed cells were treated with estrogens for 3h and assayed for p- . 
galactosidase activity. 

Detailed Description of the Invention 

Human estrogen receptor-p (hereinafter, hERP) comprises 
15 aminoterminal amino acid residues not previously known. The present 

invention encompasses isolated, purified, nucleic acids encoding authentic 
full-length hERp of 530 amino acid residues and fragments thereof which 
include nucleic acids encoding amino acids 1-45 of hERp. The invention 
also encompasses isolated, purified, polypeptides comprising hERp and 
20 peptides derived therefrom, particularly peptides which include residues 1 - 
45 of hERp. The invention also provides expression systems in which 
transcriptionally active hERp or fragments derived therefrom can be 
produced, as well as screening methods for identifying hERp agonists and 
antagonists (including tissue-specific estrogens and anti-estrogens) as well 
25 as hERP co-activators and inhibitors. 

Isolation and Characterization of the Gene Encoding hERB 

The present inventors have isolated the cDNA encoding hERP using 
the methods outlined below. Human testis Poly A + RNA (1 mg, Clontech, 
30 Palo Alto CA) was mixed with 0.5 mg oligo dT primer (GIBCO-BRL, 

Gaithersburg MD) in a total volume of 10 ml. The mixture was heated at 
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70° C for 10 minutes, and, after cooling on ice, was supplemented with 
500 mM of each deoxynucleoside triphosphate, 1X cDNA synthesis buffer, 
and 10 mM DTT to a final reaction volume of 20 ml. The mixture was 
incubated at 42°C for 2.5 minutes and then supplemented with 1-2 units 
5 reverse transcriptase (GIBCO-BRL, Gaithersburg MD), after which it was 
incubated at 45°C for 30 minutes and 50°C for 5 minutes. One-tenth of 
this mixture (approximately 2 ml) containing the cDNA template was then 
used in PCR amplification of hER(J using forward and reverse primers as 
described below. 

10 Alignment of the known rat ERP sequence (Kuiper et al. r 

Proc.Natl.Acad.Sci.USA 93:5925, 1996) with that of a human homologue 
(Mosselman et al„ FEBS Letts. 392:49, 1996) suggested that the human 
sequence lacked at least the ultimate and penultimate residues at its 
aminoterminus, as shown below: 

15 Rat: MTFYSPAVMNYS . . . (SEQ ID NO:3) 

Human: - - GYSPAVMNYS . . . (SEQ ID NO:4) 
Based on this information, PCR primers were designed that supplement the 
human sequence with the two missing aminoterminal residues M and T and 
with an artificial Kozak translation initiation sequence. The forward primer, 

20 having the sequence (SEQ ID NO: 5) 

S'-GGAAGCTTGTCGACCATCATGACCGGCTATAGCCCTGCTGTGATG-S 1 

and a reverse primer, having the sequence (SEQ ID NO:6) 

5 , -GGATCTAGAGTCGACGCGTCACTGAGACTGAGGGTTCTGG-3 ? 

were used to amplify hERp sequences in a reaction containing the following 

25 components: 2 ml of the cDNA template described above; 1X PCR buffer; 
200 mM of each deoxynucleoside triphosphate, 2 units of hot tab 
polymerase (Amersham, Arlington Heights !L), and 1mg of each of the 
forward and reverse primers. The reaction mixture was heated to 95 °C for 
2 minutes, annealed at 52°C for 1 minute, and amplified using 36 cycles of 

30 72°C for 1.5 minutes. 
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A fragment of approximately 1 500 bp in length was produced. The 
fragment digested with Hindlll and Xbal (which cleave at sites present in 
the forward and reverse primer sequences, respectively, but not in the main 
body of the amplified cDNA sequence) and cloned into the corresponding 
5 sites of the pcDNA3 expression vector (Invitrogen, Carlsbad CA). This 
assymetric cloning strategy places the 5" end of hEFtp cDNA under the 
control of the viral CMV promoter in pcDNA3 (Figures 1 and 2). Several 
insert-containing pcDNA3 clones were identified. Plasmid DNA was 
prepared from three clones using a plasmid purification kit (Qiagen, Santa 
10 Clarita CA) and their insert sequences were determined by the dideoxy 

termination method. One clone (designated R6TOTO-2.24 or Clone 3) was 
found to contain an insert with a nucleotide sequence identical to the 
published hERp sequence (Mosselman et al., FEBS Letts. 392:49, 1996) 
and had the following 5' end structure: 
15 M T G Y . . . (SEQIDNO:7) 

CCATC ATG ACC GGC TAT . . . (SEQ ID NO:8) 
This clone was designated "truncated hERP" or hERpT. 

To verify the aminoterminal and upstream sequence of human hERp, 
two independent approaches were taken, as described below. 
20 (1) 10 ml of a human ovary 5'-Stretch cDNA library 

(Clontech, Palo Alto CA) was mixed with 50 ml of 1X K solution (1X PCR 
Buffer (GIBCO-BRL, Gaithersburg MD), 2.5 mM MgCI2, 0.5% Tween-20, 
100 mg/ml Proteinase K), and the reaction mixture was incubated at 56°C 
for 2 hours, then at 99°C for 10 minutes. 5 ml of this reaction mixture 
25 were then used as template in a nested PCR reation. For the first round, 
the forward primer (pDR2 sequencing primer, Clontech, Palo Alto CA) had 
the sequence 5'-CTGGTAAGTTTAGTC I I I I IGTC-3' (SEQ ID NO:9), and 
the reverse primer (hERp-specific, designated oligo #12908) had the 
sequence S'-GCTTCACACCAAGGACTCTTTTGAG-S' (SEQ ID NO:10). 
30 The reaction contained 1 X Klentaq PCR reaction buffer (40 mM Tricine- 

KOH, 15 mM KOAc, 3.5 mM Mg(OAc)2, 75 mg/ml bovine serum albumin); 
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0.2 mM of each dNTP; 0.2 mM of each of the above primers, and 1 unit of 
Klentaq Polymerase Mix (Clontech, Palo Alto CA). Touchdown PCR 
conditions were as follows: 5 cycles of 94°C for 2 seconds and 72°C for 4 
minutes, followed by 30 cycles of 94° C for 2 seconds and 67 °C for 3 
5 minutes. 

Excess nucleotides and primers were removed from first round PCR 
reactions by purification over Wizard PCR columns (Promega, Madison Wl). 
A second-round PCR reaction was then performed using 2 ml of the purified 
first-round reaction mixture. For the second round, the forward primer was 

10 the pDR2 sequencing primer shown above, and the reverse primer had the 
sequence 5'-GTTGGCCACAACACATTTGGGCTTGT-3' (hERP-specific, 
designated oligo #13871) (SEQ ID NO:1 1). The PCR reaction and cycling 
conditions were identical to those employed in the first round. The 
products were cloned into pCR2.1 (Invitrogen) and three resulting clones 

15 were sequenced. All three clones (designated L1, L2, and L3) contained 
hERp inserts of different lengths, all of which were homologous to hERP 
and to each other. 

(2) A Marathon Ready thymus cDNA kit (Clontech) for 5* 
rapid amplification of cDNA ends (RACE) was also used to isolated hERp 5' 

20 clones. In the first round of a nested PCR reaction, 5 ml of human thymus 
Marathon-ready cDNA (Clontech) was used as template. The forward 
primer had the sequence 5 ' -CC ATCCT A AT ACG ACTC ACT AT AG G G C-3 ' 
(Adaptor primer 1, Clontech) (SEQ ID NO:12), and the reverse primer had 
the sequence B'-GCTTCACACCAAGGACTCTTTTGAG-S' (hERp-specific, 

25 designated oligo #12908) (SEQ ID NO:10). The PCR reaction and cycling 
conditions were identical to those described in (1) above. 

Excess nucleotides and primers were removed from the first round 
PCR reactions by purification over Wizard PCR columns (Promega). A 
second round PCR reaction was performed using 2 ml of the purified first 

30 round reaction. For the second round, the forward primer had the sequence 
S'-ACTCACTATAGGGCTCGAGCGGC-S' (nested adaptor primer 2, 
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Clontech) (SEQ ID NO:13), and the reverse primer had the sequence 5'- 
GTTGGCCACAACACATTTGGGCTTGT-S' (hERp-specific, designated oligo 
#13871) (SEQ ID NO:1 1). The second round PCR reaction and cycling 
conditions were identical to those employed in the first round. The 
5 products were cloned into the pCR2.1 vector and two clones were 

sequenced. The two clones contain insert sequences of different lengths 
that are homologous to hERp, to each other, and to the sequences isolated 
from a human ovary cDNA library as described above. 

All of the hERp sequences isolated by methods (1) and (2) above 

10 contained 1 10 nucleotides corresponding to hERpT sequences, as well as. 
228 additional nucleotides at the 5' end (Figure 3). 

The hERp cDNA sequence determined from these clones contained 
several important differences from the previously known human sequence. 
First, the third amino acid of the previous sequence was found to be F and 

15 not G (see above). Second, the methionine residue at the aminoterminus of 
the previous sequence was found not to be the initiator (i.e., true 
aminoterminal) residue. Rather, the authentic full-length hERp cDNA 
sequence encodes a polypeptide having 530 residues, the first 45 of which 
are not found in the previously known-human sequence (Figure 4). The 

20 sequence appears to be quite homologous to rat ERP; however, this reading 
frame was not identified previously (Kuiper et al., Proc. Natl. Acad. Sci.USA 
93:5925, 1996). Furthermore, an optimal Kozak translation initiation 
sequence is found upstream of the newly discovered initiator methionine 
codon. A termination codon was identified 63 nucleotides upstream to the 

25 authentic ATG initiator codon in the same reading frame. 

The cDNA encoding authentic full-length hERp was cloned into 
pCDNA3 under the control of the CMV promoter; this expression vector 
was designated "long hERp" or hERpL. 
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Synthesis of full-length hERB and truncated hERB 
To examine the natural start site for translation of hERp, three 
plasmids were subjected to coupled transcription-translation, encoding 
hER(3T (with a synthetic upstream translation initiation sequence), hERpL 
5 (with a synthetic upstream translation initiation sequence), and hERpL 
containing 93 nucleotides of its native upstream sequence (the entire 
sequence shown in Figure 3). The plasmids were transcribed and translated 
using the TNT T7 Coupled Reticulocyte Lysate System (Promega #L4610). 
Circular plasmid DNA was purified using Qiagen Maxi-Kit #12362. 2mg of 
10 the DNA was transcribed and translated in a single reaction in the presence 
of [35S]-methionine (New England Nuclear, Boston MA). The translation 
products were resolved on a 10% SDS polyacrylamide gel and were 
visualized by autoradiography (Figure 5). 

The resulting translation products of both hERpL products were of 
15 similar size ("63 kDa), and the hERpT product was appropriately shorter 
("56 KDa). This indicates that the initiator ATG most likely utilized in vivo 
is the ATG at position 94-96. Utilization of a further upstream ATG is 
unlikely because of a termination codon in-fram with the presumed start 
site. Confirmation of the authentic start site is achieved by subjecting hERp 
20 polypeptides to aminoterminal sequencing. 

Functional differences between full-length hERB and truncated hERB 
The experiments described below were performed to evaluate the 
transcription activation properties of full-length hERp according to the 
25 present invention and to compare it with that of truncated hERp. hERpL 
and hERpT were expressed in parallel in different cell types and tested for 
their ability to transactivate reporter genes containing estrogen response 
elements (EREs). Alternatively, hERpL and hERpT may be expressed in host 
cells containing endogenous estrogen-responsive genes and the estrogen- 
30 mediated activation of the endogenous genes is measured. 
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(i) HepG2 Cells : 

HepG2 cells {ATCQ were transfected in parallel with either 
pcDNA3-hER(3L or pcDNA3-hERpT using the calcium phosphate co- 
precipitation method. Cells were co-transfected with a reporter plasmid 
5 containing a luciferase gene preceded by either an ERE upstream of the 
thymidine kinase (TK) basal promoter, or the TK basal promoter alone. 
Cells also received a plasmid encoding (3-galactosidase under the control of 
an RSV promoter, which was used to correct for variation in DNA uptake. 
Five hours after transfection, cells were incubated with or without 10-6M 

10 1 7-P estradiol for 20 hours, after which cell extracts were prepared. 

Luciferase activity was measured by a chemiluminescent method using the 
Promega luciferase assay system, and (3-galactosidase activity was 
measured by Galactolight (Tropix, Inc., Bedford MA); luciferase activity was 
then normalized to (3-galactosidase activity. 

15 The results shown in Figure 6A indicate that, in the presence of 

estradiol, hERpT caused a 2-fold stimulation of ERE activity. By contrast, 
hERPL under the same conditions caused a 6-fold stimulation of ERE 
activity. Thus, hERPL is about 3-fold more active than hERpT in this 
circumstance. 

20 In a separate experiment, HepG2 cells were transfected with hERPL 

or hERpT as above, but the reporter gene consisted of three copies of an 
NFkB binding site upstream of the TK basal promoter. Transfected cells 
were incubated with or without interleukin-1b (IL-1b) to activate NFkB 
and/or with estradiol prior to luciferase determination. The results shown in 

25 Figure 7 indicate that hERPL was capable of attenuating the IL-1 b-mediated 
NFkB transcriptional activation (to an extent similar to that observed with 
hERa), while hERpT exhibited no inhibitory activity, 
(ii) Human endothelial cells : 

HAECT-1 cells (a clonal immortalized human aortic endothelia cell 
30 line derived by infection with Ad5 ori-SV40 ts A209) were transfected with 
pcDNA3hERPT or pcDNA3-hERpL and ERE-luciferase plasmids by 
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electroporation. After 4 hours, the cells were treated overnight with or 
without 100 nM 1 7-P estradiol prior to lucif erase activity measurements. 
The results shown in Figure 8 indicate that hEFtpL is 2-3 times more active 
than hERpT in activating the ERE-reporter gene in the presence of estradiol. 
5 In independent experiments, cells transfected under identical conditions 
were monitored for their levels of estrogen receptors using a ligand binding 
assay. The results indicate that the increased activity of hERpL relative to 
hERPT is not due to an increase in receptor number or stability, and, further, 
that the 2-3 fold increment measured in the above experiment may be an 
1 0 . underestimate of the true transactivational capacity of hERp. 
(iii) Yeast : 

S. cerevisiae strain BJ2168 (Yeast Genetic Stock Center, Berkeley 
CA) was co-transformed with an ERE-LacZ reporter plasmid {designated 
YRpE2) and yeast vectors expressing either hERpL or hERpT under the 

15 control of the yeast triose phosphate isomerase promoter in the yeast 

pYX242 vector (R&D Systems, Minneapolis MN). Transformed cells were 
grown in selective medium for 24 hours, after which they were treated in 
the presence or absence of increasing concentrations of either 17-P 
estradiol or the phytoestrogen Genistein (Research Biochemical 

20 International, Natick MA) for 3 hours prior to determination of p- 

galactosidase activity. The dose-response results shown in Figure 9 
indicate that the maximal level of estrogen-stimulated LacZ expression was 
2-fold higher in hERpL-transformed cells relative to hERpT-transformed cells. 

25 DMA. Vectors, and Expression Systems 

Many conventional techniques in molecular biology, microbiology, 
and recombinant DNA, are used in practicing the present invention. See, 
for example, Sambrook et al., 1989, Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 

30 Harbor, New York; DNA Cloning: A Practical Approach, Volumes I and II, 
1985 (D.N. Glover ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); 
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Nucleic Acid Hybridization, 1985, (Hames and Higgins); Transcription and 
Translation, 1984 (Hames and Higgins eds.); Animal Cell Culture, 1986 (R.I. 
Freshney ed.); Immobilized Cells and Enzymes, 1986 (IRL Press); Perbal, 
1984, A Practical Guide to Molecular Cloning; the series, Methods in 
5 Enzymology (Academic Press, Inc.); Gene Transfer Vectors for Mammalian 
Cells, 1987 (J. H. Miller and M. P. Calos eds., Cold Spring Harbor 
Laboratory); and Methods in Enzymology Vol. 154 and Vol. 155 (Wu and 
Grossman, and Wu, eds., respectively). 

The present invention encompasses purified, isolated, nucleic acid 

10 sequences encoding hERp, including, e.g., the nucleotide sequence depicted 
in Figure 3 SEQ ID NO:1 and subfragments derived therefrom, including 
without limitation transcriptional activation-competent fragments. An 
"isolated" or "purified" nucleic acid is a nucleic acid or polypeptide that is 
removed from its original environment (for example, its natural environment 

15 if it is naturally occurring). An isolated nucleic acid or polypeptide contains 
less than about 50%, preferably less than about 75%, and most preferably 
less than about 90%, of the cellular components with which it was 
originally associated. 

A nucleic acid that is "derived from" an hER(3 sequence is a nucleic 

20 acid sequence that corresponds to a region of the sequence, sequences 
that are homologous or complementary to the sequence, and "sequence- 
conservative variants" and "function-conservative variants". Sequence- 
conservative variants are those in which a change of one or more 
nucleotides in a given codon position results in no alteration in the amino 

25 acid encoded at that position. Function-conservative variants are those in 
which the amino acid sequence of hERp has been changed without altering 
the overall conformation and transcriptional activation function of the hERp 
polypeptide, including, but not limited to, replacement of an amino acid 
with one having similar physico-chemical properties (such as, for example, 

30 acidic, basic, hydrophobic, and the like). A large number of candidate 
; function-conservative hERp variants, as well as fragments of hERp that 
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retain transcriptional activation activity, can be prepared using routine 
recombinant DNA manipulations as well as random or site-directed 
mutagenesis techniques. Furthermore, hERp-derived variants or fragments 
that exhibit transcriptional activation activity can be identified using routine 
5 experimentation by employing the methods described herein, e.g., by co- 
expression with an appropriate reporter gene followed by measurement of 
reporter gene transcription in the presence and absence of an estrogen. 

In another embodiment, the present invention encompasses isolated, 
purified, nucleic acids comprising nucleotides 94-229 of the sequence 

10 depicted in Figure 3, SEQ ID NO:1 , which encode amino acids 1-45 of 
hERp, and sequence-conservative variants thereof . 

The nucleic acids of the present invention include purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. This 

15 includes single- and double-stranded molecules, i.e., DNA-DNA, DNA-RNA 
and RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by 
conjugating bases to an amino acid backbone. This also includes nucleic 
acids containing modified bases. 

The nucleic acids may be isolated directly from cells. Alternatively, 

20 PCR can be used to produce the nucleic acids of the invention, using either 
chemically synthesized strands or genomic material as templates. Primers 
used for PCR can be synthesized using the sequence information provided 
herein and can further be designed to introduce appropriate new restriction 
sites, if desirable, to facilitate incorporation into a given vector for 

25 recombinant expression. 

The nucleic acids of the present invention may be flanked by natural 
regulatory sequences, or may be associated with heterologous sequences, 
including promoters, enhancers, response elements, signal sequences, 
polyadenylation sequences, introns, 5'- and 3'- noncoding regions, and the 

30 like. The nucleic acids may also be modified by many means known in the 
art. Non-limiting examples of such modifications include methylation, 
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"caps", substitution of one or more of the naturally occurring nucleotides 
with an analog, internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, 
phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., 
5 phosphorothioates, phosphorodithioates, etc.). Nucleic acids may contain 
one or more additional covalently linked moieties, such as, for example, 
proteins (e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, 
etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, 
radioactive metals, iron, oxidative metals, etc.), and alkylators. PNAs are 

10 also included. The nucleic acid may be derivatized by formation of a methyl 
or ethyl phosphotriester or an alkyl phosphoramidate linkage. Furthermore, 
the nucleic acid sequences of the present invention may also be modified 
with a label capable of providing a detectable signal, either directly or 
indirectly. Exemplary labels include radioisotopes, fluorescent molecules, 

15 biotin, and the like. 

The invention also provides nucleic acid vectors comprising hER|3- 
encoding sequences or derivatives or fragments thereof. A large number of 
vectors, including plasmid and fungal vectors, have been described for 
replication and/or expression in a variety of eukaryotic and prokaryotic 

20 hosts, and may be used for gene therapy as well as for simple cloning or 
protein expression. The encoded hERp-derived polypeptides may be 
expressed by using many known vectors, such as pUC plasmids, pET 
plasmids (Novagen, Inc., Madison, Wl), or pRSET or pREP (Invitrogen, San 
Diego, CA), and many appropriate host cells, using methods disclosed or 

25 cited herein or otherwise known to those skilled in the relevant art. The 
particular choice of vector/host is not critical to the practice of the 
invention. 

Recombinant cloning vectors will often include one or more 
replication systems for cloning or expression, one or more markers for 
30 selection in the host, e.g. antibiotic resistance, and one or more expression 
cassettes. The inserted hER(3-encoding sequences may be synthesized by 
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standard methods, isolated from natural sources, or prepared as hybrids, 
etc. Ligation of the hERP-encoding sequences to transcriptional regulatory 
elements and/or to other amino acid coding sequences may be achieved by 
known methods. Suitable host cells may be 
5 transformed/transfected/infected as appropriate by any suitable method 
including electroporation, CaCI2 mediated DNA uptake, viral vector- 
mediated DNA delivery, fungal infection, microinjection, microprojectile, or 
other established methods. 

Appropriate host cells included bacteria, archebacteria, fungi, 

10 especially yeast, and plant and animal cells, especially mammalian cells. Of 
particular interest are E. coli, B. Subtilis, Saccharomyces cerevisiae, 
Saccharomyces carlsbergensis, Schizosaccharomyces pombi, SF9 cells, 
C129 cells, 293 cells, Neurospora, and HepG2 ceils, CHO cells, COS cells, 
HeLa cells, and immortalized mammalian myeloid and lymphoid cell lines. 

15 Preferred replication systems include M13, ColE1, SV40, baculovirus, 

lambda, adenovirus, and the like. A large number of transcription initiation 
and termination regulatory regions have been isolated and shown to be 
effective in the transcription and translation of heterologous proteins in the 
various hosts. Examples of these regions, methods of isolation, manner of 

20 manipulation, etc. are known in the art. Under appropriate expression 

conditions, host cells can be used as a source of recombinantly produced 
hERp-derived peptides and polypeptides. 

Advantageously, vectors may also include a transcription regulatory 
element (i.e., a promoter) operably linked to the hERP portion. The 

25 promoter may optionally contain operator portions and/or ribosome binding 
sites. Non-limiting examples of bacterial promoters compatible with E. coli 
include: P-lactamase (penicillinase) promoter; lactose promoter; tryptophan 
(trp) promoter; arabinose BAD operon promoter; lambda-derived PI promoter 
and N gene ribosome binding site; and the hybrid tac promoter derived from 

30 sequences of the trp and lac UV5 promoters. Non-limiting examples of 
yeast promoters include triose phosphate isomerase promoter, 3- 
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phosphoglycerate kinase promoter, giyceraldehyde-3-phosphate 
dehydrogenase (GAPDH) promoter, galactokinase (GAL1 ) promoter, 
galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. 
Suitable promoters for mammalian cells include without limitation viral 
5 promoters such as that from Simian Virus 40 (SV40), Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Mammalian 
cells may also require terminator sequences and poly A addition sequences 
and enhancer sequences which increase expression may also be included; 
sequences which cause amplification of the gene may also be desirable. 
10 . Furthermore, sequences that facilitate secretion of the recombinant product 
from cells, including, but not limited to, bacteria, yeast, and animal cells, 
such as secretory signal sequences and/or prohormone pro region 
sequences, may also be included. These sequences are well described in 
the art. 

15 Nucleic acids encoding hER(5-derived polypeptides may also be 

introduced into cells by recombination events. For example, such a 
sequence can be introduced into a cell, and thereby effect homologous 
recombination at the site of an endogenous gene or a sequence with 
substantial identity to the gene. Other recombination-based methods such 

20 as nonhomologous recombinations or deletion of endogenous genes by 
homologous recombination may also be used. 

The nucleic acids of the present invention find use as templates for 
the recombinant production of hERp-derived peptides or polypeptides. 

25 hERB-derived polypeptides 

The present invention encompasses purified hER(3-derived 
polypeptides comprising amino acids 1-45 of hERP and further comprising 
all or part of the amino acid sequence depicted in Figure 4, SEQ ID NO:2, 
and function-conservative variants thereof, i.e., variants that exhibit 

30 estrogen-induced transcriptional activation activity. Also encompassed by 
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the invention are peptides comprising amino acids 1-45 of SEQ ID NO:2 and 
function-conservative variants thereof. 

Nucleic acids comprising hERp-coding sequences can be used to 
direct the expression of hERfJ-derived polypeptides in intact cells or in cell- 
5 free translation systems. The known genetic code, tailored if desired for 
more efficient expression in a given host organism, can be used to 
synthesize oligonucleotides encoding the desired amino acid sequences. 
The phosphoramidite solid support method of Matteucci et al„ 1981, J. 
Am. Chem. Soc. 103:3185, the method of Yoo et al., 1989, J. Biol. Chem. 
10 764:17078, or other well known methods can be used for such synthesis. 
The resulting oligonucleotides can be inserted into an appropriate vector 
and expressed in a compatible host organism. 

The polypeptides of the present invention, including function- 
conservative variants of the disclosed hER(3 sequences, may be isolated 
15 from wild-type or mutant human cells, or from heterologous organisms or 
cells (including, but not limited to, bacteria, fungi, insect, plant, and 
mammalian cells) into which an hERp-derived protein-coding sequence has 
been introduced and expressed. Furthermore, the polypeptides may be part 
of recombinant fusion proteins. 
20 Polypeptides may be chemically synthesized by commercially 

available automated procedures, including, without limitation, exclusive 
solid phase synthesis, partial solid phase methods, fragment condensation 
or classical solution synthesis. The polypeptides are preferably prepared by 
solid phase peptide synthesis as described by Merrifield, 1963, J. Am. 
25 Chem. Soc. 85:2149. 

"Isolation" or "purification" of an hERP-derived polypeptide refers to 
the isolation of the polypeptide in a form that allows its transcriptional 
activation activity to be measured without interference by other 
components of the cell in which the polypeptide is expressed. Methods for 
30 polypeptide purification are well-known in the art, including, without 

limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, 
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reversed-phase HPLC, gel filtration, ion exchange and partition 
chromatography, and countercurrent distribution. For some purposes, it is 
preferable to produce the polypeptide in a recombinant system in which the 
hERb-derived protein contains an additional sequence tag that facilitates 
5 purification, such as, but not limited to, a polyhistidine sequence. The 
polypeptide can then be purified from a crude lysate of the host cell by 
chromatography on an appropriate solid-phase matrix. Alternatively, 
antibodies produced against an hERfJ-derived protein or against peptides 
derived therefrom can be used as purification reagents. Other purification 

10 methods are possible. 

The present invention also encompasses derivatives and homologues 
of hERp-encoded polypeptides. For some purposes, nucleic acid sequences 
encoding the peptides may be altered by substitutions, additions, or 
deletions that provide for functionally equivalent molecules, i.e., function- 

15 conservative variants. For example, one or more amino acid residues within 
the sequence can be substituted by another amino acid of similar 
properties, such as, for example, positively charged amino acids (arginine, 
lysine, and histidine); negatively charged amino acids (aspartate and 
glutamate); polar neutral amino acids; and non-polar amino acids. 

20 The isolated polypeptides may be modified by, for example, 

phosphorylation, sulfation, acylation, or other protein modifications. They 
may also be modified with a label capable of providing a detectable signal, 
either directly or indirectly, including, but not limited to, radioisotopes and 
fluorescent compounds. 

25 

hERB-specific Antibodies 
The present invention encompasses antibodies that specifically 
recognize hER(J-derived peptides and polypeptides, including without 
limitation antibodies that recognize hERP but not, e.g., hERa, and those that 
30 recognize hERpL but not hERpT. Such hERp-specif ic antibodies can be used 
conventionally, e.g., as diagnostic reagents or as reagents for purification 
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of hER(3-derived polypeptides. Other uses include immunocytochemical 
localization of hERP; gel shift assays; and "pull-down" experiments to 
identify protein co-activators associated with hERp. 

hERP-specific antibodies according to the present invention include 
5 polyclonal and monoclonal antibodies. The antibodies may be elicited in an 
animal host by immunization with hERp immunogenic components or may 
be formed by in vitro immunization (sensitization) of immune cells. The 
immunogenic components used to elicit the production of antibodies may 
be isolated from any cell source or may be chemically synthesized. The 

10 antibodies may also be produced in recombinant systems programmed with 
appropriate antibody-encoding DNA. Alternatively, the antibodies may be 
constructed by biochemical reconstitution of purified heavy and light 
chains. The antibodies include hybrid antibodies (i.e., containing two sets 
of heavy chain/light chain combinations, each of which recognizes a 

15 different antigen), chimeric antibodies (i.e., in which either the heavy 

chains, light chains, or both, are fusion proteins), and univalent antibodies 
(i.e., comprised of a heavy chain/light chain complex bound to the constant 
region of a second heavy chain). Also included are Fab fragments, 
including Fab 1 and F(ab)2 fragments of antibodies. Methods for the 

20 production of all of the above types of antibodies and derivatives are well- 
known in the art. For example, techniques for producing and processing 
polyclonal antisera are disclosed in Mayer and Walker, 1 987, 
Immunochemical Methods in Cell and Molecular Biology, (Academic Press, 
London). The general methodology for making monoclonal antibodies by 

25 hybridomas is well known. Immortal antibody-producing cell lines can be 
created by cell fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or transfection with 
Epstein-Barr virus. See, e.g., Schreier et al., 1980, Hybridoma Techniques. 
Panels of monoclonal antibodies produced against hERp epitopes can be 

30 screened for various properties; i.e., for isotype, epitope affinity, etc. 
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Antibodies against hERP-derived immunogenic components can be 
used, unlabeled or labeled by standard methods, as the basis for 
immunoassays. The particular label used will depend upon the type of 
immunoassay used. Examples of labels that can be used include but are 
5 not limited to radiolabels such as 32P, 1 25I, 3H and 14C; fluorescent labels 
such as fluorescein and its derivatives, rhodamine and its derivatives, 
dansyl and umbelliferone; chemiluminescers such as luciferia and 2,3- 
dihydrophthal-azinediones; and enzymes such as horseradish peroxidase, 
alkaline phosphatase, lysozyme and glucose-6-phosphate dehydrogenase. 

10 

Applications 

The methods and compositions of the present invention can be used 
to identify compounds that interact with hERP, either to activate or to 
inhibit its transcriptional activation function. Such compounds include, 
15 without limitation, co-activator proteins, as well as estrogens and other 
steroids, steroid-like molecules, or non-steroid-like molecules that act as 
agonists or antagonists. Screening methods can also be used to identify 
tissue-specific estrogens. 

Identification of hERp-interactive compounds can be achieved by 
20 cell-free or cell-based assays. In one set of embodiments, purified hERP is 
contacted with a labelled ligand, such as, e.g., 17-p estradiol, in the 
presence of test compounds to form test reactions, and in the absence of 
test compounds to form control reactions. The labelled moiety may 
comprise a radiolabel (such as, e.g., 3H or 1251) or a fluorescent molecule. 
25 Incubation is allowed to proceed for a sufficient time and under appropriate 
conditions to achieve specific binding, after which binding of labelled 
estradiol to hERp is measured (by monitoring, e.g., radioactivity, 
flurorescence, or fluorescence polarization). In one embodiment, hERp 
produced in E. coli (as described in Example 1 below) is adsorbed to the 
30 wells of a microtiter dish and incubated with [3H]-17p estradiol in the 
absence or presence of test compounds (see, e.g., Example 2 below). 
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Alternatively, soluble receptor is incubated with the labelled ligand in the 
absence or presence of test compounds, and bound ligand is separated 
from free ligand, either by filtration on glass fiber filters or by using dextran- 
coated charcoal. See, e.g., Hulme, ed., Receptor-ligand Interactions: A 
5 Practical Approach, IRL Press, NY, 1992). 

Whole cell binding assays may also be used in which bound ligand is 
separated from free ligand by rinsing. Cells used in these assays may either 
contain endogenous receptor, or may overexpress the receptor subsequent 
to stable or transient transfection or infection of an hERp gene or cDNA. 

10 Non-limiting examples of suitable cells include COS cells, Hela cells, CHO 
cells, human umbilical vein endothelial cells (HUVEC), and yeast. Once a 
compound has been identified as an hER(3-interactive compound by its 
binding activity, further in vivo and in vitro tests may be performed to 
determine the nature and extent of activity, i.e., as an agonist or antagonist 

15 (see below). 

hERp-interactive compounds may also be identified using cell-based 
assays that measure transcriptional activation or suppression of 
endogenous or transfected estrogen-responsive genes. For example, 
agonists (such as, e.g., 17p-estradiol) block interleukin-1b induction of 

20 endogenous E-selectin in primary human umbilical vein endothelial cells 

(HUVEC) that express hERp. Antagonists (such as, e.g., ICI-1 82780) block 
the agonist activity of 1 7p-estradiol. Non-limiting examples of other 
suitable endogenous estrogen-responsive promoter elements include those 
that regulate endothelin-1 (ET-1); HDL receptor (scavenger receptor type II); 

25 and enzymes involved in coagulation and fibrinolysis (such as, e.g., 
plasminogen activator inhibitor- 1 and complement C3). Any promoter 
element that responds to estrogen may be used as an appropriate target, 
including, e.g., the NFkB binding site or the apolipoprotein A1 gene 
enhancer sequence. 
30 In one set of embodiments, appropriate host cells are transfected 

with an expression vector encoding hERp and the transfectants are 
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incubated with or without estradiol in the presence or absence of test 
compounds. hEFtp activity is assessed by measuring transcriptional 
activation of the target sequence. This may be achieved by detection of 
mRNA (using, e.g., Northern blot analysis) and/or by detection of the 
5 protein (using, e.g., immunoassays or functional assays). If activation of 
the target sequence initiates a biochemical cascade, downstream biological 
events may also be measured to quantify hERp activity. hERP-interactive 
compounds are identified as those that positively or negatively influence 
target sequence activation. 

10 In another set of embodiments, appropriate host cells (preferably, . 

bacterial or yeast cells) are co-transfected with an expression vector 
encoding hERP and a reporter plasmid containing a reporter gene 
downstream of one or more estrogen response elements (EREs). 
Transfected cells are incubated with or without estradiol in the presence of 

15 absence of test compounds, after which hERp activity is determined by 
measuring expression of the reporter gene. In a preferred embodiment, 
hERP activity is monitored visually. Non-limiting examples of suitable 
reporter genes include luciferase, chloramphenicol acetyl transferase (CAT), 
and green fluorescence protein. 

20 Preferably, the methods of the present invention are adapted to a 

high-throughput screen, allowing a multiplicity of compounds to be tested in 
a single assay. Candidate estrogens and estrogen-like compounds include 
without limitation diethylstilbesterol, genistein, and estrone. Other hERp- 
interactive compounds may be found in, for example, natural product 

25 libraries, fermentation libraries (encompassing plants and microorganisms), 
combinatorial libraries, compound files, and synthetic compound libraries. 
For example, synthetic compound libraries are commercially available from 
Maybridge Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, 
NJ), Brandon Associates (Merrimack, NH), and Microsource (New Milford, 

30 CT). A rare chemical library is available from Aldrich Chemical Company, 
Inc. (Milwaukee, Wl). Alternatively, libraries of natural compounds in the 
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form of bacterial, fungal, plant and animal extracts are available from, for 
example, Pan Laboratories (Bothell, WA) or MycoSearch (NC), or are readily 
producible. Additionally, natural and synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical, 
5 and biochemical means (Blondelle et al., TibTech 14:60, 1996). hERP 
binding assays according to the present invention are advantageous in 
accommodating many different types of solvents and thus allowing the 
testing of compounds from many sources. 

Compounds identified as hERP agonists or antagonists using the 
10 methods of the present invention may be modified to enhance potency, 

efficacy, uptake, stability, and suitability for use in therapeutic applications, 
etc. These modifications are achieved and tested using methods well- 
known in the art. 



15 Description of the Preferred Embodiments 

The following examples are intended to illustrate the present 
invention without limitation. 



Example 1 : High-level Expression of Human hERB in E. coli 
20 Human hERp according to the present invention is overexpressed in 

E. coli strain BL21 (DE3) using, for example, the pET15B vector. A 10-ml 
overnight culture is used to inoculate 1 liter LB medium containing 100 
mg/ml ampicillin. Cultures are grown at 37 °C and then induced by the 
addition of 1 mM IPTG. After an additional incubation for 2h at 25°C, cells 
25 are harvested by centrifugation at 10,000 X G for 30 minutes and 

resuspended in 100 ml of a buffer containing 50 mM Tris-HCI, pH 7.4 -1 50 
mM NaCI. Cells are lysed using a French press, and insoluble material is 
pelleted by centrifugation. The supernatant solution is recovered and stored 
at -70°C. 



30 
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Example 2: Estrogen Receptor-B Liqand Binding Assay 
For determining the ability of a particular compound to bind hERP, 
100 ml of the receptor preparation described in Example 1 above, diluted in 
assay buffer (Dulbecco's phosphate buffered saline (Gibco #14200-075) 
5 supplemented with 1 mM EDTA), is added to each well of a high-binding 
masked microtiter plate (Wallac #1450-51 1 , Gaithersburg MD). 10 ml of 
test compound (or vehicle) and 10 ml of [3H]-17p-estradiol are added to 
each well, and the plate is incubated at room temperature for 4-6 hours. 
Unbound material is aspirated, and the plate is washed three times with 
10 300 ml of assay buffer. Then, 150 ml of scintillation cocktail {Optiphase 

Supermix, Wallac #1200-439) is added per well, and the plate is sealed and 
agitated for at least 5 min. Bound radioactivity is measured by scintillation 
counting. 

Test compounds are initially tested at a concentration of 1.5 mg/ml 
15 (approximately 5 mM for a compound having a molecular mass of 300). 
Positive compounds are then re-tested at a number of different 
concentrations to determine the IC50. 

Data are expressed as percent inhibition of specific binding. 
Exploratory data analysis (EDA) is performed on raw data to check for non- 
20 normality and non-homogeneity of variance. The maximum likelihood Box- 
Cox transformation, which maximizes the normality, homogeneity of 
variance, and goodness of fit of the data, is then obtained. Based on the 
result, the appropriate transformation of the data (no transformation, square 
root transformation, or logarithmic transformation) is used for model fitting. 
25 The Huber M-estimator is used to down weight any outlying transformed 
observations for analysis of variance and dose-response curve fitting. 

For ANOVA, multiple comparisons LSD p-values are computed. Re- 
transformed summary statistics (mean, s.d, s.e.m.) are obtained for each 
treatment group. 

30 For dose-response curve fitting, a four parameter logistic model on 

; the transformed, weighted data are fit. The four parameters are min, max, 
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slope, and ED50, where ED50 is defined as the dose which corresponds to 
midway between the estimated max and min. All of the parameters and 
confidence intervals are re-transformed back to the original units of the 
data. A further transformation into percent inhibition (using estimated min 
5 and max) is performed. 

Using this assay, the following values were obtained for reference 
compounds: 

IC50 95% confidence limits 

1 7p-estradiol 6.7 nM 6 - 7.5 nM 

10 diethylstilbestrol 21 nM 14-31 nM 

genistein 1 .6 nM 1 .4 - 1 .8 nM 



All patents, applications, articles, publications, and test methods 
mentioned above are hereby incorporated by reference. 

Many variations of the present invention will suggest themselves to 
those skilled in the art in light of the above detailed description. Such 
obvious variations are within the full intended scope of the appended 
claims. 
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Claims: 

1 . An isolated nucleic acid comprising the sequence 
depicted in Figure 1, SEQ ID NO:1. 

2. A nucleic acid as defined in claim 1 , wherein said 
nucleic acid is DNA. 

3. A nucleic acid as defined in claim 1, wherein said 
nucleic acid is RNA. 

4. A recombinant DNA vector comprising a sequence as 
defined in claim 1 . 

5. A recombinant DNA vector comprising a sequence as 
defined in claim 1 operably linked to a transcription regulatory element. 

6. A ceil comprising a DNA vector as defined in claim 5, 
wherein said cell is selected from the group consisting of bacterial, fungal, 
plant, insect, and mammalian cells. 

7. A method for producing a polypeptide, said method 
comprising incubating a cell as defined in claim 6 under conditions that 
permit expression of one or more polypeptides encoded by said nucleic 
acid. 

8. A method as defined in claim 7, further comprising: 

(a) harvesting said incubated cells to produce a 
cell fraction and a medium fraction; and 

(b) recovering said one or more polypeptides from 
said cell fraction, said medium fraction, or both. 

9. A purified isolated nucleic acid encoding the amino 
acid sequence depicted in Figure 1 SEQ ID NO:2. 

10. A nucleic acid as defined in claim 9, wherein said 
nucleic acid is DNA. 

11. A nucleic acid as defined in claim 9, wherein said 
nucleic acid is RNA. 

1 2. A recombinant DNA vector comprising a sequence as 
defined in claim 9. 
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1 3. A recombinant DNA vector comprising a sequence as 
defined in claim 9 operably linked to a transcription regulatory element. 

14. A cell comprising a DNA vector as defined in claim 
1 3, wherein said cell is selected from the group consisting of bacterial, 
fungal, plant, insect, and mammalian cells. 

15. A method for producing a polypeptide, said method 
comprising incubating a cell as defined in claim 14 under conditions that 
permit expression of one or more polypeptides encoded by said nucleic 
acid. 

16. A method as defined in claim 1 5, further comprising: 

(a) harvesting said incubated cells to produce a 
cell fraction and a medium fraction; and 

(b) recovering said one or more polypeptides from 
said cell fraction, said medium fraction, or both. 

17. A purified polypeptide comprising a sequence selected 
from the group consisting of the sequence depicted in Figure 1 SEQ ID 
NO:2 and function-conservative variants thereof. 

18. A purified polypeptide comprising amino acids 1-45 of 
the sequence depicted in Figure 1 SEQ ID NO:2. 

1 9. A method for identifying hERp-interactive compounds, 
said method comprising: 

(a) contacting purified hERp with a labelled ligand 
in the presence of test compounds, to form test reactions, and in the 
absence of test compounds, to form control reactions; 

(b) incubating said test and control reactions 
under appropriate conditions to achieve equilibrium binding of said labelled 
ligand to hERP; 

(c) determining the level of binding of said labelled 
ligand to hERp in said test and control cultures; and 

d) identifying as a hERp-interactive compound 
any compound that reduces the binding of said labelled ligand to hERp. 
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20. A method as defined in claim 1 9, wherein said ligand 
is 1 7-P estradiol. 

21 . A method as defined in claim 19, wherein said hERp- 
interactive compound is an agonist. 

22. A method as defined in claim 1 9, wherein said hER0- 
interactive compound is an antagonist. 

23. An antibody that specifically recognizes hEFtp. 
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CAGCCAnATACTTGCCCACGMTCTTIGAGMCATTATAATGACCTTTGTGCCTCTTCT 60 

TGCMGGTGTTTTCTCAGCTGCTATCTCAAGACATGGATATAAAAAACTCACCATCTAGC 120 

CTTAATTCTCCTTCCTCCTACAACTGCAGTCAATCCATCTTACCCCTGGAGCACGGCTCC 180 

ATATACATACCnCCTCCTAJSTAGACAGCCACCAIG>\ATATCCAGCCAIQACATTCTAT 240 

AGCCCTGCTGTGATGAATTACAGCATTCCCAGCAATGTCACTAACTTGGAAGGTGGGCCT 300 

GGTCGGCAGACCACAAGCCCAAATGTGnGTGGCCAACACCTGGGCACCITTCTCCTTTA 360 

GTGGTCCATCGCCAGTTATCACATCTGTATGCGGAACCTCAAAAGAGTCCCTGGTGTGAA 420 

GCAAGATCGCTAGAACACACCTTACCTGTAAACAGAGAGACACTGAAAAGGAAGGTTAGT 480 

GGGMCCGnGCGCCAGCCCTGTTACTGGTCCAGGTTGAAAGAGGGATGCTCACTTCTGC 540 

GCTGTCTGCAGCGATTACGCATCGGGATATCACTATGGAGTCTGGTCGTGTGAAGGATGT 600 

MGGCCTTTmAAMGMGCAnCMGGACATMTGAnATATTTGTCCAGCTACAAAT 660 

CAGTGTACAATCGATAAAAACCGGCGCAAGAGCTGCCAGGCCTGCCGACTTCGGAAGTGT 720 

TACGAAGTGGGAATGGTGAAGTGTGGCTCCCGGAGAGAGAGATGTGGGTACCGCCTTGTG 780 

CGGAGACAGAGAAGTGCCGACGAGCAGCTGCACTGTGCCGGCAAGGCCAAGAGAAGTGGC 840 

GGCCACGCGCCCCGAGTGCGGGAGCTGCTGCTGGACGCCCTGAGCCCCGAGCAGCTAGTG 900 

CTCACCCTCCTGGAGGCTGAGCCGCCCCATGTGCTGATCAGCCGCCCCAGTGCGCCCTTC 960 

ACCGAGGCCTCCATGATGATGTCCCTGACCAAGTTGGCCGACAAGGAGTTGGTACACATG 1020 

ATCAGCTGGGCCAAGAAGATTCCCGGCTTTGTGGAGCTCAGCCTGTTCGACCAAGTGCGG 1080 

CTCTTGGAGAGCTGTTGGATGGAGGTGTTAATGATGGGGCTGATGTGGCGCTCAATTGAC 1140 

CACCCCGGCMGCTCATCTTTGCTCCAGATCTTGTTCTGGACAGGGATGAGGGGAAATGC 1200 

GTAGMGGMTTCTGGAMTCTTTGACATGCTCCTGGCAACTACTTCAAGGTTTCGAGAG 1260 

TTAAAACTCCAACACAAAGAATATCTCTGTGTCAAGGCCATGATCCTGCTCAATTCCAGT 1320 

ATGTACCCTCTGGTCACAGCGACCCAGGATGCTGACAGCAGCCGGAAGCTGGCTCACTTG 1380 

CTGMCGCCGTGACCGATGCTnGGTTTGGGTGATTGCCAAGAGCGGCATCTCCTCCCAG 1440 

CAGCAATCCATGCGCCTGGCTAACCTCCTGATGCTCCTGTCCCACGTCAGGCATGCGAGT 1500 

AACAAGGGCATGGAACATCTGCTCAACATGAAGTGCAAAAATGTGGTCCCAGTGTATGAC 1560 

CTGCTGCTGGAGATGCTGAATGCCCACGTGCTTCGCGGGTGCAAGTCCTCCATCACGGGG 1620 

TCCGAGTGCAGCCCGGCAGAGGACAGTAAAAGCAAAGAGGGCTCCCAGAACCCACAGTCT 1680 

CAGTGA lfi 86 

FIG. 3 
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MnTKNSPSSI NSPSSYNCSO STI PI FHGSI YIPSSYVDSH HEYPAMTFYS 50 

PAVMNYSIPS NVTNLEGGPG RQTTSPNVLW PTPGHLSPLV VHRQLSHLYA 100 

EPQKSPWCEA RSLEHTLPVN RETLKRKVSG NRCASPVTGP GSKRDAHFCA 150 

VCSDYASGYH YGVWSCEGCK AFFKRSIQGH . NDYICPATNQ CTIDKNRRKS 200 

CQACRLRKCY EVGMVKCGSR RERCGYRLVR RQRSADEQLH CAGKAKRSGG 250 

HAPRVRELLL DALSPEQLVL TLLEAEPPHV LISRPSAPFT EASMMMSLTK 300 

LADKELVHMI SWAKKIPGFV ELSLFDQVRL LESCWMEVLM MGLMWRSIDH 350 

PGKLIFAPDL VLDRDEGKCV EGILEIFDML LATTSRFREL KLQHKEYLCV 400 

KAMILLNSSM YPLVTATQDA DSSRKLAHLL NAVTDALVWV IAKSGISSQQ 450 

QSMRLANLLM LLSHVRHASN KGMEHLLNMK CKNVVPVYDL LLEMLNAHVL 500 

RGCKSSITGS ECSPAEDSKS KEGSQNPQSQ 531 



FIG. 4 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: American Home Products Corporation 

(B) STREET: Five Giralda Farms 

(C) CITY: Madison 

(D) STATE: New Jersey 

(E) COUNTRY: U.S.A. 

(F) POSTAL CODE (ZIP): 07940 

(G) TELEPHONE: (973) 683-2169 

(H) TELEFAX: (973) 683-4117 

(ii) TITLE OF INVENTION: Novel Human Estrogen Receptor-B 
(iii) NUMBER OF SEQUENCES: 13 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CAGCCATTAT 


ACTTGCCCAC 


GAATCTTTGA 


GAACATTATA 


ATGACCTTTG 


TGCCTCTTCT 


60 


TGCAAGGTGT 


TTTCTCAGCT 


GCTATCTCAA 


GACATGGATA 


TAAAAAACTC 


ACCATCTAGC 


120 


CTTAATTCTC 


CTTCCTCCTA 


CAACTGCAGT 


CAATCCATCT 


TACCCCTGGA 


GCACGGCTCC 


180 


ATATACATAC 


CTTCCTCCTA 


TGTAGACAGC 


CACCATGAAT 


ATCCAGCCAT 


GACATTCTAT 


240 


AGCCCTGCTG 


TGATGAATTA 


CAGCATTCCC 


AGCAATGTCA 


CTAACTTGGA 


AGGTGGGCCT 


300 


GGTCGGCAGA 


CCACAAGCCC 


AAATGTGTTG 


TGGCCAACAC 


CTGGGCACCT 


TTCTCCTTTA 


360 


GTGGTCCATC 


GCCAGTTATC 


ACATCTGTAT 


GCGGAACCTC 


AAAAGAGTCC 


CTGGTGTGAA 


420 


GCAAGATCGC 


TAGAACACAC 


CTTACCTGTA 


AACAGAGAGA 


CACTGAAAAG 


GAAGGTTAGT 


480 


GGGAACCGTT 


GCGCCAGCCC 


TGTTACTGGT 


CCAGGTTCAA 


AGAGGGATGC 


TCACTTCTGC 


540 


GCTGTCTGCA 


GCGATTACGC 


ATCGGGATAT 


CACTATGGAG 


TCTGGTCGTG 


TGAAGGATGT 


600 


AAGGCCTTTT 


TTAAAAGAAG 


CATTCAAGGA 


CATAATGATT 


ATATTTGTCC 


AGCTACAAAT 


660 


CAGTGTACAA 


TCGATAAAAA 


CCGGCGCAAG 


AGCTGCCAGG 


CCTGCCGACT 


TCGGAAGTGT 


720 
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TACGAAGTGG 


GAATGGTGAA 


GTGTGGCTCC 


CGGAGAGAGA 


GATGTGGGTA 


CCGCCTTGTG 


780 


CGGAGACAGA 


GAAGTGCCGA 


CGAGCAGCTG 


CACTGTGCCG 


GCAAGGCCAA 


GAGAAGTGGC 


840 


GGCCACGCGC 


CCCGAGTGCG 


GGAGCTGCTG 


CTGGACGCCC 


TGAGCCCCGA 


GCAGCTAGTG 


900 


CTCACCCTCC 


TGGAGGCTGA 


GCCGCCCCAT 


GTGCTGATCA 


GCCGCCCCAG 


TGCGCCCTTC 


960 


ACCGAGGCCT 


CCATGATGAT 


GTCCCTGACC 


AAGTTGGCCG 


ACAAGGAGTT 


GGTACACATG 


1020 


ATCAGCTGGG 


CCAAGAAGAT 


TCCCGGCTTT 


GTGGAGCTCA 


GCCTGTTCGA 


CCAAGTGCGG 


1080 


CTCTTGGAGA 


GCTGTTGGAT 


GGAGGTGTTA 


ATGATGGGGC 


TGATGTGGCG 


CTCAATTGAC 


1140 


CACCCCGGCA 


AGCTCATCTT 


TGCTCCAGAT 


CTTGTTCTGG 


ACAGGGATGA 


GGGGAAATGC 


1200 


GTAGAAGGAA 


TTCTGGAAAT 


CTTTGACATG 


CTCCTGGCAA 


CTACTTCAAG 


GTTTCGAGAG 


1260 


TTAAAACTCC 


AACACAAAGA 


ATATCTCTGT 


GTCAAGGCCA 


TGATCCTGCT 


CAATTCCAGT 


1320 


ATGTACCCTC 


TGGTCACAGC 


GACCCAGGAT 


GCTGACAGCA 


GCCGGAAGCT 


GGCTCACTTG 


1380 


CTGAACGCCG 


TGACCGATGC 


TTTGGTTTGG 


GTGATTGCCA 


AGAGCGGCAT 


CTCCTCCCAG 


1440 


CAGCAATCCA 


TGCGCCTGGC 


TAACCTCCTG 


ATGCTCCTGT 


CCCACGTCAG 


GCATGCGAGT 


1500 


AACAAGGGCA 


TGGAACATCT 


GCTCAACATG 


AAGTGCAAAA 


ATGTGGTCCC 


AGTGTATGAC 


1560 


CTGCTGCTGG 


AGATGCTGAA 


TGCCCACGTG 


CTTCGCGGGT 


GCAAGTCCTC 


CATCACGGGG 


1620 


TCCGAGTGCA 


GCCCGGCAGA 


GGACAGTAAA 


AGCAAAGAGG 


GCTCCCAGAA 


CCCACAGTCT 


1680 


CAGTGA 












1686 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp He Lys Asn Ser Pro Ser Ser Leu Asn Ser Pro Ser Ser Tyr 
15 10 15 

Asn Cys Ser Gin Ser He Leu Pro Leu Glu His Gly Ser He Tyr He 
20 25 30 

Pro Ser Ser Tyr Val Asp Ser His His Glu Tyr Pro Ala Met Thr Phe 
35 40 45 

Tyr Ser Pro Ala Val Met Asn Tyr Ser He Pro Ser Asn Val Thr Asn 
50 55 60 
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Leu Glu Gly Gly Pro Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp 
65 70 75 80 

Pro Thr Pro Gly His Leu Ser Pro Leu Val Val His Arg Gin Leu Ser 
85 90 95 

His Leu Tyr Ala Glu Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser 
100 105 HO 

Leu Glu His Thr Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val 
115 120 125 

Ser Gly Asn Arg Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg 
130 135 140 

Asp Ala His Phe Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His 
145 150 155 160 

Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser 
165 170 175 

He Gin Gly His Asn Asp Tyr He Cys Pro Ala Thr Asn Gin Cys Thr 
180 185 190 

He Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys 
195 200 205 

Cys Tyr Glu Val Gly Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys 
210 215 220 

Gly Tyr Arg Leu Val Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His 
225 230 235 240 

Cys Ala Gly Lys Ala Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg 
245 250 255 

Glu Leu Leu Leu Asp Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu 
260 265 270 

Leu Glu Ala Glu Pro Pro His Val Leu He Ser Arg Pro Ser Ala Pro 
275 280 285 

Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys 
290 295 300 

Glu Leu Val His Met He Ser Trp Ala Lys Lys He Pro Gly Phe Val 
305 310 315 320 

Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met 
325 330 335 

Glu Val Leu Met Met Gly Leu Met Trp Arg Ser He Asp His Pro Gly 
340 345 350 

Lys Leu He Phe Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys 
355 360 365 

Cys Val Glu Gly He Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr 
370 375 380 
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Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val 
385 390 395 400 

Lys Ala Met lie Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala 
405 410 415 

Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala 
420 425 430 

Val Thr Asp Ala Leu Val Trp Val lie Ala Lys Ser Gly lie Ser Ser 
435 440 445 

Gin Gin Gin Ser Met Arg Leu Ala Asn Leu. Leu Met Leu Leu Ser His 
450 455 460 

Val Arg His Ala Ser Asn Lys Gly Met Glu His Leu Leu Asn Met Lys 
465 470 475 480 

Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn 
485 490 495 

Ala His Val Leu Arg Gly Cys Lys Ser Ser lie Thr Gly Ser Glu Cys 
500 505 510 

Ser Pro Ala Glu Asp Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin 
515 520 525 

Ser Gin 
530 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Thr Phe Tyr Ser Pro Ala Val Met Asn Tyr Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Tyr Ser Pro Ala Val Met Asn Tyr Ser 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGAAGCTTGT CGACCATCAT GACCGGCTAT AGCCCTGCTG TGATG 4 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGATCTAGAG TCGACGCGTC ACTGAGACTG AGGGTTCTGG 4 0 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Met Thr Gly Tyr 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) / TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCATCATGAC CGGCTAT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTGGTAAGTT TAGTCTTTTT GTC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTTCACACC AAGGACTCTT TTGAG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTTGGCCACA ACACATTTGG GCTTGT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCATCCTAAT ACGACTCACT ATAGGGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ACTCACTATA GGGCTCGAGC GGC 
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